A Vector Space Model for Syntactic Distances Between Dialects

نویسندگان

  • Emanuele Di Buccio
  • Giorgio Maria Di Nunzio
  • Gianmaria Silvello
چکیده

Syntactic comparison across languages is essential in the research field of linguistics, e.g. when investigating the relationship among closely related languages. In IR and NLP, the syntactic information is used to understand the meaning of word occurrences according to the context in which their appear. In this paper, we discuss a mathematical framework to compute the distance between languages based on the data available in current state-of-the-art linguistic databases. This framework is inspired by approaches presented in IR and NLP.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Syntactic structure and geographical dialects in the songs of male rock hyraxes.

Few mammalian species produce vocalizations that are as richly structured as bird songs, and this greatly restricts the capacity for information transfer. Syntactically complex mammalian vocalizations have been previously studied only in primates, cetaceans and bats. We provide evidence of complex syntactic vocalizations in a small social mammal: the rock hyrax (Procavia capensis: Hyracoidea). ...

متن کامل

Globalization, Standardization, and Dialect Leveling in Iran

This paper is an attempt to shed light on the effects of modernization, urbanization, monolingual educational system, and mass media as well as the process of globalization on dialect leveling among Persian dialects. In so doing, the first part of the paper elaborates on the relationship between globalization and sociolinguistics, and on the concept of standardization. Also, it discusses some ...

متن کامل

Identification of negated regulation events in the literature: exploring the feature space

Background. Regulation events are of critical importance to researchers trying to understand processes in living beings. These events are naturally complex and can involve both individual molecular entities and other biomedical events. Of equal importance is the ability to capture statements that refer to regulation events that do not take place. In this paper we explore the identification of n...

متن کامل

Perceptive evaluation of Levenshtein dialect distance measurements using Norwegian dialect data

The Levenshtein dialect distance method has proven to be a successful method for measuring phonetic distances between Dutch dialects. The aim of the present investigation is to validate the Levenshtein dialect distance with perceptual data from a language area other than the Dutch, namely Norway. We calculate the correlation between the Levenshtein distances and the distances between 15 Norwegi...

متن کامل

Measuring Syntactic Distances between Dialects: A Web Application for Annotating Dialectal Data

• 15:00-16:30 Session Chair: Maurizio Messina, Biblioteca Nazionale Marciana, Venezia • 15:00-16:00 Invited Talk: Sapienza Digital Library Tiziana Catarci, Marco Schaerf Dipartimento di Ingegneria Informatica Automatica e Gestionale “Antonio Ruberti”, Sapienza Università di Roma • 16:00-16:30 Invited Presentation: Digital Cultural Heritage Projects Opportunities and Future Challenges Rossella C...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014